Add support for OCI Generative AI Chat Completions #94

speglich · 2025-06-23T19:23:49Z

This PR introduces a new OCIGenAIChatCompletionsClient to the llmperf framework, enabling benchmark tests using Oracle Cloud Infrastructure's (OCI) Generative AI chat completions endpoints.

The integration leverages the ChatOCIGenAI client from the langchain_community package to interface with OCI’s streaming chat API, supporting metrics collection such as:

Time to First Token (TTFT)
Inter-token latency
End-to-end latency
Output throughput
Token counts (input/output)

The new client reads necessary configuration from environment variables including OCI_COMPARTMENT_ID, OCI_AUTH_TYPE, OCI_CONFIG_PROFILE, OCI_ENDPOINT, OCI_MODEL_ID, and OCI_PROVIDER.

This addition allows users to include OCI Generative AI in their LLM benchmark suites alongside other providers, improving coverage and flexibility in performance evaluations.

speglich added 2 commits June 23, 2025 19:11

feature: add OCIGenAIChatCompletions client

5494524

fix: add missing oci dependency

fe02b44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for OCI Generative AI Chat Completions #94

Add support for OCI Generative AI Chat Completions #94

Uh oh!

speglich commented Jun 23, 2025

Uh oh!

Uh oh!

Add support for OCI Generative AI Chat Completions #94

Are you sure you want to change the base?

Add support for OCI Generative AI Chat Completions #94

Uh oh!

Conversation

speglich commented Jun 23, 2025

Uh oh!

Uh oh!